Internvit 300M 448px V2 5
MIT
InternViT-300M-448px-V2_5 is a major upgrade based on InternViT-300M-448px, enhancing visual feature extraction capabilities through ViT incremental learning and NTP loss, particularly excelling in handling multilingual OCR data and complex scenarios like mathematical charts.